[PyTorch] Add dtype information to QuantizedTensorStorage class by ptrendx · Pull Request #2676 · NVIDIA/TransformerEngine

ptrendx · 2026-02-12T19:07:06Z

Description

This PR adds the fake dtype information to the QuantizedTensorStorage class. This eliminates the need to guess the correct type for dequantize, as was the case in the distributed.py, and it eliminates the unintentional dequantization to FP32 when calling dequantize() on the Storage class with no dtype argument.

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refactoring

Changes

Please list the changes introduced in this PR:

Added the _dtype field to the QuantizedTensorStorage class
Modified the dequantize call to use that new field when calling dequantize with no arguments
Removed guessing of the dtype from distributed.py

Checklist:

I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Signed-off-by: Przemek Tredak <ptredak@nvidia.com>

for more information, see https://pre-commit.ci

greptile-apps · 2026-02-12T19:13:31Z

Greptile Summary

This PR adds a _dtype field (the "fake" high-precision dtype) to every QuantizedTensorStorage subclass and threads it through constructors, get_metadata(), and dequantize() defaults. This cleanly removes the hardcoded torch.bfloat16 guesses in distributed.py and ensures that dequantize() called without arguments no longer silently degrades to FP32.

Key changes and concerns:

fake_dtype validation gate in QuantizedTensor.__new__ — The guard if fake_dtype is not None and fake_dtype != dtype: raise ValueError(...) (line 376–377 of quantized_tensor.py) is logically correct in isolation, but since get_metadata() now always emits fake_dtype=self._dtype, any code that calls make_like(tensor, dtype=new_dtype) with a different dtype will immediately hit this ValueError. There are 10+ existing call sites in the codebase (attention/dot_product_attention/context_parallel.py, utils.py, tensor/__init__.py) that pass an explicit dtype to make_like, and the module-level cast_to_dtype helper (used by model.half() / model.bfloat16()) iterates over all QuantizedTensor parameters using exactly this pattern. This is a regression that will surface in normal training workflows.
No backward-compatibility guard for old pickled storage objects — The new _dtype field is only set in __new__; there is no hasattr fallback in dequantize() (unlike the dtype property on QuantizedTensor). Unpickling a *TensorStorage that was saved before this PR will result in AttributeError: _dtype on the first dequantize call.
Storage classes correctly updated — All four storage classes (Float8TensorStorage, Float8BlockwiseQTensorStorage, MXFP8TensorStorage, NVFP4TensorStorage) follow a consistent pattern and correctly dispatch between storage-only (object.__new__) and full-tensor (super().__new__) paths.

Confidence Score: 2/5

Not safe to merge as-is — the fake_dtype validation in QuantizedTensor.__new__ will break existing make_like(dtype=X) call sites throughout the attention module and the module-level dtype-cast utility.
The core idea (storing the high-precision dtype on storage objects) is sound and the distributed.py and C++ changes are clean improvements. However, the new validation guard in QuantizedTensor.__new__ combined with fake_dtype being unconditionally included in get_metadata() creates a regression: any call to make_like or to_dtype that changes the nominal dtype — including the widely-used cast_to_dtype module helper — will now raise a ValueError. This is a blocking correctness issue for normal training workflows.
transformer_engine/pytorch/quantized_tensor.py — the fake_dtype validation logic and its interaction with get_metadata() requires the most attention before merging.

Important Files Changed

Filename	Overview
transformer_engine/pytorch/quantized_tensor.py	Adds `_dtype` annotation to `QuantizedTensorStorage` and a `fake_dtype` parameter to `QuantizedTensor.__new__`. The new validation guard (`fake_dtype != dtype → ValueError`) is correct in isolation but breaks all `make_like(dtype=X)` call sites where X differs from the tensor's current `_dtype`, because `get_metadata()` now injects `fake_dtype` matching the old dtype.
transformer_engine/pytorch/tensor/storage/float8_tensor_storage.py	Adds `fake_dtype` parameter to `__new__`, propagates it via `super().__new__` for the full-tensor path, stores it as `_dtype` for the storage-only path, and adds it to `get_metadata()`. Logic is correct; `view()` and `get_metadata()` now preserve `_dtype` correctly.
transformer_engine/pytorch/tensor/storage/nvfp4_tensor_storage.py	Previously always called `super().__new__()` regardless of `cls`, corrected to the same storage-vs-full-tensor dispatch pattern as other storage classes. `fake_dtype` propagated correctly throughout.
transformer_engine/pytorch/distributed.py	Removes three hardcoded `dtype = torch.bfloat16` guesses and replaces them with `dtype = inp._dtype`, which is now reliably set on all storage objects produced by this PR. Clean improvement.
transformer_engine/pytorch/csrc/quantizer.cpp	Adds `kwargs["fake_dtype"] = GetATenDType(dtype)` to all five `create_tensor` call sites in the internal (storage-only) C++ path. The full-tensor path already passes `dtype` as a top-level constructor arg so no change needed there.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A["C++ Quantizer::create_tensor(dtype)"] -->|"internal=true"| B["Float8TensorStorage.__new__\n(fake_dtype=GetATenDType(dtype))"]
    A -->|"internal=false"| C["Float8Tensor.__new__\n(dtype=GetATenDType(dtype))"]

    B -->|"cls is Storage"| D["object.__new__()\ninstance._dtype = fake_dtype"]
    B -->|"cls is Tensor subclass"| E["super().__new__(cls, fake_dtype=fake_dtype)"]
    C --> E

    E --> F["QuantizedTensor.__new__(dtype, fake_dtype)\nValidate: fake_dtype == dtype if not None\ninstance._dtype = dtype"]

    F --> G["QuantizedTensor._dtype set"]
    D --> G

    G -->|"dequantize() called"| H{"dtype arg?"}
    H -->|"None"| I["use self._dtype"]
    H -->|"explicit"| J["use explicit dtype"]
    I --> K["dequantize to correct high-precision dtype"]
    J --> K

    style D fill:#90EE90
    style G fill:#90EE90
    style F fill:#FFB6C1,stroke:#FF0000
    note1["⚠️ Validation fails for\nmake_like(dtype=X) when X != _dtype"]
    F -.-> note1

_{Last reviewed commit: 369f8b5}

greptile-apps

_{13 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

ptrendx · 2026-02-12T19:16:45Z

/te-ci pytorch

ksivaman

LGTM

timmoon10

Overall this is a big improvement. I have some naming nits.

timmoon10 · 2026-02-14T00:28:14Z

transformer_engine/pytorch/quantized_tensor.py

        shape: Iterable[int],
        dtype: torch.dtype,
        *,
+        fake_dtype: Optional[torch.dtype] = None,


Isn't this redundant with the dtype kwarg?

This is mostly to avoid issues with MRO and still have fairly straightforward constructors for the Storage classes.

Also just noticed that the make_like call would be problematic there otherwise - we want to include the fake_dtype in get_metadata call, but if it was named dtype it would clash with the dtype that we pass directly in make_like.

timmoon10 · 2026-02-14T00:36:17Z

transformer_engine/pytorch/tensor/storage/float8_tensor_storage.py

        data: Optional[torch.Tensor],
        fp8_scale_inv: torch.Tensor,
        fp8_dtype: TE_DType,
+        fake_dtype: Optional[torch.dtype] = None,


I'd prefer to just name it dtype since QuantizedTensor is already using that name in its constructor.

Suggested change

fake_dtype: Optional[torch.dtype] = None,

dtype: Optional[torch.dtype] = None,

transformer_engine/pytorch/tensor/storage/nvfp4_tensor_storage.py

Signed-off-by: Przemek Tredak <ptredak@nvidia.com>

greptile-apps

_{13 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

timmoon10

Still not a fan of fake_dtype, but approving to unblock.

ptrendx · 2026-02-24T19:50:32Z

/te-ci pytorch

Signed-off-by: Przemek Tredak <ptredak@nvidia.com>

ptrendx · 2026-03-04T19:55:31Z

/te-ci pytorch

greptile-apps · 2026-03-04T20:00:28Z

transformer_engine/pytorch/quantized_tensor.py

+        if fake_dtype is not None and fake_dtype != dtype:
+            raise ValueError(f"fake_dtype ({fake_dtype}) does not match dtype ({dtype})")


Validation breaks existing make_like call sites

This new guard will cause regressions on every call to make_like(tensor, dtype=X) where X differs from tensor._dtype, because get_metadata() now always injects fake_dtype=self._dtype into kwargs, and QuantizedTensor.__new__ is then called with both dtype=X (the intended new dtype) and fake_dtype=old_dtype (from metadata).

Confirmed breakage paths:

transformer_engine/pytorch/tensor/__init__.py:63 — module cast utility (model.half(), model.bfloat16(), etc.) calls tensor.__class__.make_like(tensor, dtype=dtype) for every QuantizedTensor; whenever dtype != tensor._dtype the model cast will raise ValueError.

attention/dot_product_attention/context_parallel.py — 10+ call sites of the form Float8Tensor.make_like(x, data=y, dtype=fwd_nominal_dtype) where fwd_nominal_dtype may differ from x._dtype.

attention/dot_product_attention/utils.py:2220 — same pattern.

The root cause is that fake_dtype is being included in get_metadata() but the constructor-level guard then rejects any case where the caller wants to create a clone at a different nominal dtype. Either:

Remove the guard (it is redundant for the full-tensor path, because QuantizedTensor.__new__ already sets _dtype = dtype), or

Override fake_dtype in QuantizedTensor.make_like so it matches the requested dtype before calling the constructor.

greptile-apps · 2026-03-04T20:00:29Z

transformer_engine/pytorch/quantized_tensor.py

+    _dtype: torch.dtype
    _quantizer: Optional[Quantizer]


No lazy-init guard for _dtype on storage objects

QuantizedTensor.dtype (line 405–409) has a hasattr(self, "_dtype") lazy-initializer that protects against deserialization from pre-PR checkpoints. QuantizedTensorStorage and its subclasses have no equivalent protection — _dtype: torch.dtype is only a class-level annotation, not a default value.

If an *TensorStorage object is unpickled from a checkpoint that was saved before this PR, the first call to .dequantize() (or the distributed-ops in distributed.py that now access inp._dtype) will raise AttributeError: _dtype.

Consider adding a similar lazy fallback in the dequantize methods, e.g.:

if dtype is None: dtype = getattr(self, "_dtype", torch.float32)

ptrendx added 2 commits February 11, 2026 11:45

First pass

f6ad0bb

Signed-off-by: Przemek Tredak <ptredak@nvidia.com>

Cleaning the dtype usage in dequantize and distributed

ab059a8

Signed-off-by: Przemek Tredak <ptredak@nvidia.com>

ptrendx requested a review from timmoon10 February 12, 2026 19:07

[pre-commit.ci] auto fixes from pre-commit.com hooks

8200276

for more information, see https://pre-commit.ci

greptile-apps bot reviewed Feb 12, 2026

View reviewed changes

ksivaman previously approved these changes Feb 12, 2026

View reviewed changes

timmoon10 previously approved these changes Feb 14, 2026

View reviewed changes

Add fake_dtype to get_metadata

be723b2

Signed-off-by: Przemek Tredak <ptredak@nvidia.com>

ptrendx dismissed stale reviews from timmoon10 and ksivaman via be723b2 February 18, 2026 01:40

greptile-apps bot reviewed Feb 18, 2026

View reviewed changes

timmoon10 previously approved these changes Feb 19, 2026

View reviewed changes

Merge branch 'main' into pr_dtype_in_storage

369f8b5

Signed-off-by: Przemek Tredak <ptredak@nvidia.com>

ptrendx dismissed timmoon10’s stale review via 369f8b5 March 4, 2026 19:55

greptile-apps bot reviewed Mar 4, 2026

View reviewed changes

	fake_dtype: Optional[torch.dtype] = None,
	dtype: Optional[torch.dtype] = None,

		if fake_dtype is not None and fake_dtype != dtype:
		raise ValueError(f"fake_dtype ({fake_dtype}) does not match dtype ({dtype})")

Conversation

ptrendx commented Feb 12, 2026

Description

Type of change

Changes

Checklist:

Uh oh!

greptile-apps bot commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 2/5

Important Files Changed

Flowchart

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

ptrendx commented Feb 12, 2026

Uh oh!

ksivaman left a comment

Choose a reason for hiding this comment

Uh oh!

timmoon10 left a comment

Choose a reason for hiding this comment

Uh oh!

timmoon10 Feb 14, 2026

Choose a reason for hiding this comment

Uh oh!

ptrendx Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

ptrendx Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

timmoon10 Feb 14, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

timmoon10 left a comment

Choose a reason for hiding this comment

Uh oh!

ptrendx commented Feb 24, 2026

Uh oh!

ptrendx commented Mar 4, 2026

Uh oh!

greptile-apps bot Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

greptile-apps bot commented Feb 12, 2026 •

edited

Loading